DOCUMENT RESUME 



ED 051 270 



T« 000 584 



AUTHOR 
TITLE 
PUB DATE 
NOTE 



EDRS PRICE 
DESCRIPTORS 



IDENTIFIERS 



Nigro, George A. 

A Strategy to Determine Causal Direc 
Jan 71 

9p. ; Paper presented in the Session: 
Applications, Part II, at the Annual 
the National Council on Measurement 
New /ork, New York, February 1971 

EDRS Price MF-S0.65 HC-J3.29 
♦Correia ion. Educational Research, 
Testing, ♦Mathematical Models, *Mult 
Analysis, ^Research Methodology, *st 
Ana lysis 

Beta Index, *Causal Theory 



AES iR ACT 



A set of mathematical consistencies 
conditions of inequality in a theorem is summarized. 



for 


i ts appl i 


cation with real 


da 


ta is 


present 


ed. The 


st ra 


tegy ace 


suggested for in 


med 


ia te 


use by the pract 


ca us 


e-ef f ect 


relationships in 


a 


syste 


m of var 


iables t 


work 


and time 


in analysis and 


i n 


ter pre ta tion. 


especia 


the 


masses of 


data dealt with 


in 


the 


largely 


nonexper 


cerr 


e lat iona I 


world of educat 


ion 


. See 


also ED 


048 362 



t ions. 

Statist ical 
Convention of 
in Education, 



Hypothesis 
iple Regression 
atis t ical 



that f 


orms 


and a 


strategy 


theore 


m and 


itioner seeking 


o cut 


down guess 


lly in 


light of 


imenta 


1 or 


. <Aut 


ho r/AG) 




o 
f - 

fV? 

U^\ 

O 

o 

LI.-' 



US DE P AT T M E NT OF EDUCATION 

ft WELFARE 

OFFICE Of EOU CATION 
THIS DOCUMENT h AS BEEN REPRODUCED 
EXACTLY AS RECEIVED F RO M THE PERSON OR 
ORGANIZATION ORIGINATING IT FOINTS OF 
VIEW OR OPINIONS STATEO 00 NOT NECES 
SARI' Y REPRESENT OFFICIAL OFFICE OF EDO 
CATION POSITION OR POLICY 



A STRATEGY 

TO EETERMINE OAUSAS EJECTIONS 



OEOROb N1GR0 
Bristol Community Colima 
Kail Rlvar, Maaaachuaatta 02720 
January 1971 

’> K f 




Presented in tho Session I Statistical Application a, Part II 
National Council on Measurement in Education 
At tha Annual Convention 
Nov York City, Nay York 
T obruary 7» 1971 



1 



A STRATH}! TO DETERMINE CAUS4L Hi RECTI 3 



ABSTRACT 

TWL s paper gumma:! fces a sot of mathematical consistencies that form 
conditions of inequality in a thooren and presents a strategy for its application 
vlth real data* as derived from two studies by the author* The theorem and strategy 
aie suggested for immediate me by the practitioner seeking cause-effect rela- 
tionships in a system of variables, to cut down guoss work and time in analysis 
and interpretation, especially in light of the requirement to handle masse* of 
data in a largely nonexperimental or correlational world of education. 
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A STRATEGY TO DETERMINE CAUSJIL ECREOTIONS 



George A. Nlgro 
Bristol Community College 

A paper presented at the 1969 AERA Annual Meeting 1 exposed a set of in- 
equalities baaed on the development of a "beta index#* This index was proposed 
as e confirming device for ceueal directions of three-variebl^ patha in a stepped- 
regreesion and correlation analyst e from dummy deta. These iuequs litie a vore 
completely consistent for causal directions but were inconsistent foi acausal 
direotione. The set was generalized to four mathematical statements of asymmetry 
and proposed an a theorem for a follow-up paper at the 1971 AERA Annual Meeting* 
which was completed in 1970. The latter study repeated *he previous one with one 
important difference* that horl zontrl rather than normal distributions were gene- 
rated and substituted in.o recursive equations to obtain variables for analysis. 
The results of both scudies were tho same. During the course of the latter study 
and reinforced by a revisit to the previous one, a strategy seemed to emerge for 
determining causal directions as well as confirming hypothesi ze d directions; 
that is, regardless of causal directions hypothesi zed, it appeared possible to 
generate hypotheses. 

Hence* the purpose of this paper is to present such strategy along with 
a converse of the theorem in revised form. 



The educational practitioner often labels his dependent variables as 
effects in studies, but he often speaks softly in calling his independent 
viriables Gauges; in fact, he does not dare! If only he had g strategy of 
determining how his variables operate in a system he chooses, mathematically 
as well as logically* he would not only find support for his contentions but 
also cut dovn guess work and time in analysts and interpretation. Because eAica- 
tior is largely a nonexperimental or correlational world and is more and more 
immersed in masses of data to handle* it would be to the educator’s advantage 
to have such a strategy, 

Blalock^ pulled together materials on causal models* but many years later 
caution is still extreme to accept his strategy despite increasing suggestions 
to borrow from fields outside of education# Spring boar ding flora Blalock* the 
investigator in developing a causal model stumbled into what ho named the bst» 
constancy prinoiplo t the beta coefficient of an intermediate variable in a 
causal direction remains relatively constant as other system variables ure intro- 
duced and controlled in stepped regression* whereas that in the acsusal.di recti on 
changes noticeably, VI th an assist from Blalock* and Driver and Massey 1 *, the prlft- 
oiplt served as the seed for the theorem and strategy# The assist* basically* 
was that the magnitude of the correlations should be smaller between variables 
furthest removed from one another in causal sequance. There concepts are 
epitomized in Figure 1 below for a three-VBrlabl e path and in conditions 1 through 
k of the theorem. 

In Figure 1* if variables A, B, end C are related cauaally, and if other 
system variables K, as appropriate* are controlled, the rs among the variables 
would be in ascending order of magnitude r ac «w r ab*k> ^bc^k* Also* would 

rerraln relatively constant* though slightly changed, as other variables, variable 
A neinly, were stepped into regression* whereas would change noticeably# 
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Figure 1* Causal Erection A-B-C 



THEOREM FOR CAUSE- EFFECT RKLATI ON SHi PS 



In a three-variable path coned sting of variables A, 3, and C, with control 
for other ay stem Variables K, if 



! d ^ct..k/J < | d/ 3 a b.>/cl 

where a and o are stepped into regression to obtain beta diff ©r onti ol s, 

l^ca*kbl i^ic'kbl * 



I r ac • k I c 
! r ac *kb 



r ab*kl < 
l r ab*kcl 



l^bck! i 
i 

I ^bo*k* 



( 1 ) 

( 2 ) 

(5) 

W 



where k for the fir at members of the last two conditions are logically 
derived, then variable A 5 a the initial cause, B the interne? iate 3 and 0 
the final effect# ~ 



STRATH! Y FOR APPLICATION OF THE THEOREM 

1# Assume a set of causal directions in c hypothesized system of variables, 
then coldly set aside the hypotheses and also disregard intuitive guides 
until logical analysis xs called for# 

2. Systematically prepare matrices of the following statistics# 
a# Correlations, both zero-order and partial g« 

b. Beta coef fici ent s of all regression equations possible in the ^rsteo# 
c# Multiple correlations and variances of all possible combinations. 

J. Compare blocks of correlations from step 2a by triads, with conditions 5 
and A as guides, then make tentative decisions as to causal directions# 

If all appropriate .-n tries yield the same causal directions, hold decifdon 
for later analysis with betas# If one of the several triads is inconsistent 
with directions among other triads, any causal direction should be rejected# 

A# Repeat step i or each combination of triads in the system. 
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PrdpB.ro regression equations to find beta differentials and beta coefficients 
for end variables* as taken from step 2b* for one seleoted three-variable path# 

a* Pick a zero-order equation and record the beta coefficient on the tentative 
intermediate variable* then step in tne end variable of the three-vari abl e 
path selected and record both beta coefficients* a rraw beta on the inter- 
mediate variable and one for the end variable stepped in* 

b. Repeat step 5* for the opposite direotion* 

c* Find the algebraio difference between the two betas of the pairs of 

equations; this step produces two beta differentials for a given threo- 
variable path for opposite directions. Also produced are end-yari abl e betas* 

d* Co.’npsje the heta differentials. For condition 1, make a tentative 
decision as to causal direction* 

e. Compare beta coefficients of end variables stepped in* For condition 2, 
make a tentative decision as to causal direction. 

f . Repeat steps 5* through 5e vith an added variable K of the system prior 
to stepping the end variable of the three-variabl e path into regression. 

Any variable K is held constant but not considered for either beta- 
differential or beta-coefficient comparison* 

g. Repeat step uncil all variables X are stepped in, only one at a time. 

V . If all compari sons yiel d the same causal directions, compare the oversill 
decision as to causal direction to the deoiaion of step J* If both are 
the same as to causal direction* record the decision for later logical 
analysis. If both are inconsistent, any causal direction should be 
rejected. If compari sion of beta differentials and beta coefficients 
la inconsistent vith directions among others* any causal direotion should 
be rejected* Comparisons are of magnitudes only. 

Repeat step 5 f° r ®* c h three-variable path, until all threo-variabl e paths 
have been studied* 

Prepare a diagram from decisions made, shewing connections among variables 
with arrowhead leaders to show directions of cause-effect. If any link shows 
mutual Causation, arrows in both directions! then, but only then, return to 
the correlations for a logical analysis; this may be the case when the three 
variables are confounded. 

s. In the logical analysis* first con si deration should be given to the fact 
that both conditions 5 arid A are compound inequalities, each consisting 
of three simple inequalities. Return to the information of step J ar.d 
take any two of the three block* of correlations; thes* will already have 
supplied a definite order of relations for all entries* If there is a 
oorrelation entry of the third colucn vhloh Is less than the lesser of 
the other two colunns* inspect the diagram to determine whether it supplies 
a possible relationship guoh as the Driver and Kassey prinoiple; that is, 
the relationship between variables furthest removed from one another, in 
causal sequence* ii the least of all relationships* 
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b # Repeat step 7 a until all links are unidirectional, if any link at all, 

8. Check patterns of the diagram prepared in step 7 and determine the appro- 
priate regression equations for dependent variables. Be cautious not to 
include system variables which are not directly related or lndlreotly 
related through causal directions. 

9 . find total variances accounted for by regression equations of step 6 from 
step 2 o, then find tb > proportions contributed by each independent variable 
of the equations. The total from partitions and that from the table of 
step 2c should be equal with only rounding error. If the total variance 
accounted for in any equation i# low, then poseibly one should introduce 
new variables into the system and repeat the en + lre process. The caution 
of step 8 , if heeded, should preclude adding falec variance just for the 
sake of variance accounting; it should also preclude variance , cauged > in 
independent variables at the beginning of a causal chain. 

10. Compare the diagram of step 7 to that hypothesized snd set aside in step 1, 
Chances are, they are similar. Those directions which were found by the 
strategy should fall into two categories. 

a. If the directions found are the game as hypothesized, the thoorem 
confirms the \y~po theses. 

b. If the direction* found were not originally hypothesized, the theorem 
determine s new hypotheses. 



APPLICATION OF STRATEGY 

The following data have been extracted froa the most recent study^ for steps 
5 through 6 . Note that Tables 1 and 2 exemplify conditions J-4 arid 1 — 2 , 
re gpecti vely . 



Table 1. Correlation Comparisons for Variable* 2, 4, and 5> with Controls* 



Control 


r 25- 


r 24. 




Di reotion 0 




For Condition 






7566 


6066 


8850 


2 - 5-4 


i 


8250 


7410 


8754 


2-5-4 


5 


6559 


5805 


7919 


2-5-4 


15 


7266 


6119 


8251 


2-5-4 




For Condition 4® 




X 


5974 


2067 


81 65 


2-5-4 


X] 


5451 


0688 


6959 


2-5-4 




5669 


2571 


7699 


2-5-4 


xl5 


49 65 


0555 


69 65 


2-5-4 



a Decimal points precede values listed. b For partial* of irs. 

c From comparison of abirlut* values, d Control only for variable* K. 

e Control for third variable In path as well as for variable* K; hence, used. 
♦ At .If of 200, tqy is greater than or equal to . 181 A 
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Direction 2-5-4 is the tentative decision for condi tion* 5 and 4 from the 
correlation study* Note that r 25 la less than rp^, rpr la lees than r 4 ^, and 
rp^ la lesa than r^ f important ior later logical analysis. 

Regression equations in standard form were next studied for path 254, 
only those beta coefficients entered aa necessary in the table, for variable* 
in the thr ee-variabl e path 254. 



Table 2, Beta Studies for Path 254* 



1 


Direction 2-5-4^ 


Direction 4-5-2° 


Direction** 


Other X^ 


X 2 d $^5 


d .625 x 5 x 4 


3 d 3 




$850 


7538 - 




- 


-1489 9959 1129 


25 " 1.0119 -2869 


2-5-4 2-5-4 


1 


7847 


8898 - 




1 


0489 7411 04j6 


0760 81 58 0989 


2 - 5-4 2-5-4 


5 


7576 


8958 




5 


-1769 3807 1251 


28J 0 9738 -5755 


2 - 5-4 2-5-4 


1, 5 


7810 


6888 - 




1, 5 


0288 7415 0197 


05 .*. 5 6551 044C 


2-5-4 2-5-4 



a Decimal points precede values listed eecept where actually shown, 
b Variable 4 is predicted. c Variable 2 is predicted, 

d From comparison of absolute values, e Variables in regression oquationa. 



Again direction 2-5-4 is the deoiden; conditions 1 and 2 are satisfied. 

Variables 2, 4, and 5 wore purposely selected for presentation here, because 
direction 2-4-5* not 2-5-4, was the designed causal path in the last atudy^. 
Obviously, the variables were confounded; but notice how consistent the patterns 
are. Conditions 5 arid 4 were revisited to separate these into three rimple 
inequalities each, as noted above. Some of the entries in the first block of 
of correlations, rp^, are obviously lover in value than pome in the second block, 
r 24 *j all of whose values are less than corresponding values of the third block, 
r^ej. But It was also found that corapari aor . s of betas and beta differentials for 
condition!, 1 and 2 consistently produced directions 2-4-5 and 4-2-5, in tables 
similar to Table 2 above. (No wonder the word "confounded" has an emotional 
stigm*. attached to Lt not unlike Angl 0 saxon e so l ) 

Returning to the model, repeated in Figure 2 below, then again to the 
correlation matrix for a logical analysis, it appeared that to satisfy the C'»beca 
best, three inequalities would have to be true; but only one could be truej 



For 


direction 2-5-4, 


l r 24.l 


l< 


l r 25-4 


<6 1 r 45* 2 1 • 


(5) 


For 


direction 2-4-5, 


l r 25-4 


l< 


1 **24 - 1 ' 


1 **45 • 2 1 • 


(6) 


For 


direction 4-2-5, 


| r 45-2 


Ic 


| r 24-l 


1 | r 25-4 1 • 


(7) 



Checking the values in Table 1 above, only Inequality 6 is true. This atep ia 
admittedly pragmatic, but on approaching logic from the foregoing math base it is 
considered leas so than groping in the dark from personal a priori biaa. 

ERIC 
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The matter of choosing which of the system variable s to control brought 
about much in the way of correlation chess garring, but nevertheless direction 

was the only one of the three considered above that could not be rejeoted. 
But the theorem needs revision, to consider conditions 5 and 4 more flexibly by 
dropping the first member of the two compound inequalities and possibly to add a 
i if th condition such that 

l r ic-k'l< I’Yvk*! < | r bc-*"! » 

where variables kP follow from logical analysis* 9uch revision remains for 
further study. 




Figure 2 Causal Model of Five Variables^ 
(See Footnote.) 



Also found in the previous study by the theorem was direction l-A-5i as 
expected, with no confounding whatsoever; no other directions were found causal. 
A return to the first study revealed the same results about the several paths. 

The rest is left as an exercise to students of etati stic b— - li terally hours 
and lays. 



nsnssicN and conclusion 

Yhether or not hypotheses sre made, the strategy to determine causal 
directions in a system of variables may reveal both relationships and directions. 
If no directions can be found, then a search for other variables to be brought 
into the system should be made. Obviously a system of relationships with causal 
directions seems most significant. SuchShtf stem may then serve as a m^dol to 
conduct an experiment in curriculum, administration, tests, and whatnot or at 
least further survey resecr^.h and evaluation of practices. But the theorem taus^ 
be tried with real data maiy times and possibly refined to be completely accepted. 
For the present more co p-iter time rust be logged. 
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FOOTNOTE 

The tvo studios, references 1 and 2, were completed under the Title IV 
fellowship program, ESEA, PL 89-10 {directed by Dr. John J. Valah). The 
model of Figure 2 was used in both studies with the one exception as noted 
(originally suggested by Dr. Ronald L. Nvttall, also of Boston College). 
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